NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Protecting Privacy against Membership Inference Attack with LLM Fine-tuning through Flatness

https://doi.org/10.1137/1.9781611978520.41

Chen, Tiejin; Da, Longchao; Zhou, Huixue; Li, Pingzhi; Zhou, Kaixiong; Chen, Tianlong; Wei, Hua (January 2025, Society for Industrial and Applied Mathematics)

Full Text Available
RegExplainer: Generating Explanations for Graph Neural Networks in Regression Tasks

Zhang, Jiaxing; Chen, Zhuomin; Mei, Hao; Da, Longchao; Luo, Dongsheng; Wei, Hua (December 2024, 38th International Conference on Neural Information Processing Systems)

Full Text Available
RegExplainer: Generating Explanations for Graph Neural Networks in Regression Tasks

Zhang, Jiaxing; Chen, Zhuomin; Mei, Hao; Da, Longchao; Luo, Dongsheng; Wei, Hua (December 2024, 38th Conference on Neural Information Processing Systems (NeurIPS 2024))

Full Text Available
Shaded Route Planning Using Active Segmentation and Identification of Satellite Images

https://doi.org/10.1145/3627673.3679234

Da, Longchao; Chhibba, Rohan; Jaiswal, Rushabh; Middel, Ariane; Wei, Hua (October 2024, ACM)

Full Text Available
SynTraC: A Synthetic Dataset for Traffic Signal Control from Traffic Monitoring Cameras

https://doi.org/10.1109/ITSC58415.2024.10920204

Chen, Tiejin; Shirke, Prithvi; Chakravarthi, Bharatesh; Vaghela, Arpitsinh; Da, Longchao; Lu, Duo; Yang, Yezhou; Wei, Hua (September 2024, IEEE)

Full Text Available
Prompt to Transfer: Sim-to-Real Transfer for Traffic Signal Control with Prompt Learning

https://doi.org/10.1609/aaai.v38i1.27758

Da, Longchao; Gao, Minquan; Mei, Hao; Wei, Hua (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

Numerous solutions are proposed for the Traffic Signal Control (TSC) tasks aiming to provide efficient transportation and alleviate traffic congestion. Recently, promising results have been attained by Reinforcement Learning (RL) methods through trial and error in simulators, bringing confidence in solving cities' congestion problems. However, performance gaps still exist when simulator-trained policies are deployed to the real world. This issue is mainly introduced by the system dynamic difference between the training simulators and the real-world environments. In this work, we leverage the knowledge of Large Language Models (LLMs) to understand and profile the system dynamics by a prompt-based grounded action transformation to bridge the performance gap. Specifically, this paper exploits the pre-trained LLM's inference ability to understand how traffic dynamics change with weather conditions, traffic states, and road types. Being aware of the changes, the policies' action is taken and grounded based on realistic dynamics, thus helping the agent learn a more realistic policy. We conduct experiments on four different scenarios to show the effectiveness of the proposed PromptGAT's ability to mitigate the performance gap of reinforcement learning from simulation to reality (sim-to-real).
more » « less
Full Text Available
Probabilistic Offline Policy Ranking with Approximate Bayesian Computation

https://doi.org/10.1609/aaai.v38i18.30019

Da, Longchao; Jenkins, Porter; Schwantes, Trevor; Dotson, Jeffrey; Wei, Hua (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

In practice, it is essential to compare and rank candidate policies offline before real-world deployment for safety and reliability. Prior work seeks to solve this offline policy ranking (OPR) problem through value-based methods, such as Off-policy evaluation (OPE). However, they fail to analyze special case performance (e.g., worst or best cases), due to the lack of holistic characterization of policies’ performance. It is even more difficult to estimate precise policy values when the reward is not fully accessible under sparse settings. In this paper, we present Probabilistic Offline Policy Ranking (POPR), a framework to address OPR problems by leveraging expert data to characterize the probability of a candidate policy behaving like experts, and approximating its entire performance posterior distribution to help with ranking. POPR does not rely on value estimation, and the derived performance posterior can be used to distinguish candidates in worst-, best-, and average-cases. To estimate the posterior, we propose POPR-EABC, an Energy-based Approximate Bayesian Computation (ABC) method conducting likelihood-free inference. POPR-EABC reduces the heuristic nature of ABC by a smooth energy function, and improves the sampling efficiency by a pseudo-likelihood. We empirically demonstrate that POPR-EABC is adequate for evaluating policies in both discrete and continuous action spaces across various experiment environments, and facilitates probabilistic comparisons of candidate policies before deployment.
more » « less
Full Text Available
Open-ti: open traffic intelligence with augmented language model

https://doi.org/10.1007/s13042-024-02190-8

Da, Longchao; Liou, Kuanru; Chen, Tiejin; Zhou, Xuesong; Luo, Xiangyong; Yang, Yezhou; Wei, Hua (May 2024, International Journal of Machine Learning and Cybernetics)

Full Text Available
Uncertainty-Aware Grounded Action Transformation Towards Sim-to-Real Transfer for Traffic Signal Control

https://doi.org/10.1109/CDC49753.2023.10383645

Da, Longchao; Mei, Hao; Sharma, Romir; Wei, Hua (December 2023, IEEE)

Full Text Available
Libsignal: an open library for traffic signal control

https://doi.org/10.1007/s10994-023-06412-y

Mei, Hao; Lei, Xiaoliang; Da, Longchao; Shi, Bin; Wei, Hua (November 2023, Machine Learning)

This paper introduces a library for cross-simulator comparison of reinforcement learning models in trafc signal control tasks. This library is developed to implement recent state-of-the-art reinforcement learning models with extensible interfaces and unifed crosssimulator evaluation metrics. It supports commonly-used simulators in trafc signal control tasks, including Simulation of Urban MObility(SUMO) and CityFlow, and multiple benchmark datasets for fair comparisons. We conducted experiments to validate our implementation of the models and to calibrate the simulators so that the experiments from one simulator could be referential to the other. Based on the validated models and calibrated environments, this paper compares and reports the performance of current state-of-theart RL algorithms across diferent datasets and simulators. This is the frst time that these methods have been compared fairly under the same datasets with diferent simulators.
more » « less
Full Text Available

« Prev Next »

Search for: All records